Preface

statistics


Computational Musicology

In short, the use of statistical techniques and statistics to study musical systems, which is more or less synonymous with the computational music theory or computer science (notes, chords, rhythms, etc., and patterns thereof). Computer musicology is a type of data science by combining computer, statistics, and information. Yet computer musicology is strongly within the modern humanities because of the scientific dimension of music and reflects on the same issues as conventional humanities. Corpus studies are probably the most frequent form of computer musicology project. In a vast selection of works of music, a corpus analysis uses algorithms to interpret statistical patterns.

I will look into film composer and producer Ludwig Göransson for this report. I have taken data on nine of his new Spotify film scores and have combined all the results in the Correlalogram above. I have to zoom in to some playlist and variables because his database is very large. I’m excited to learn about my story if I can find important relationships between various variables. As you can already see there are two negative (important) correlations in the variable acousticness. I will analyse what these developments indicate and how I can show them in various ways in the following pages.

Significance?

Acousticness? So what actually is acousticness? It’s not that difficult, and quite self-explanatory. It means how acoustic a track is. I believe that Göransson is keen to study because he has a great combination between acoustic and electronic sounds. They are well balanced, as you can see from the Tenet ratings. His latest score is also this. Looking even at fruitvale station for example, you find that the mean acousticness is much higher.


Analysis

Main


A first step in improving the interpretation of a certain signal is to divide into building blocks more usable for the following processing steps. If these blocks consist of sinusoidal functions, a method of this kind is often referred to as the Fourier analysis. Sinusoidal functions are particular since they have an explicit physical significance with regard to frequency. The subsequent decomposition then unfolds the frequency range of a signal, like a mirror that can be used to divide light into its spectral constituent colours. The Fourier transforms a signal dependent on time into a frequency representation. As one of the main instruments in signal processing, we are going to meet a number of music processing activities with the Fourier transformation system.

Again looking at acousticness, you’ll see a significant difference between Fruitvale station (Romance) and Tenet (Action). I used this chart to pick out certain tracks that will be analysed on a deeper level later on.

Introduction


As you can see Black Panther’s has some really interesting findings. For example, “Kilmonger challenge” has one of the highest loudness (as seen in the point size), and “Kilmonger” has one of the highest Instrumentalness value. Another interesting finding is the song “Rainy night in Talin” from Tenet. This song has a really high acousticness, high energy and a BPM of 130. The findings above are on the ’’action/superhero" spectrum, and I want to compare these with songs from its opposite genre: “Romance”.

Sci-fi or action?

Ludde

Ludwig Ludwig


chorddiagramm


Chroma

Chroma


From top to down:

From Mumbai to Amalfi - Acousticness: 0.9470
Retrieving the case - Acousticness: 0.4650
Trucks in place - Acousticness: 0.0328

the human pitch vision is periodic in that two pitches are seen as identical in “color” as they differentiate from one octave (playing a similar harmonic role). Based on this observation, a pitch is divided into two parts known as tone height and chroma. The tone height corresponds to the number of harmonics and the chroma to the attributes found in the collection. We list chroma values in[0:11] where 0 is referred to as chroma C, 1 to C, etc. . A pitch class is the collection of all pitches with a common chroma. The music synchronization class pitch 1243 that is the chroma C class is set, comprising all pitches separated by an octave. For convenience, we interchange the expressions chroma and pitch. By summing all the pitch factor coefficients that belong to the same chroma, a chroma representation or chromagram can be derived.

In the case of chroma I picked one songs with a low acousticnes (around 0), one with a high acousticness (around 1), and one with an average acousticness (around 0.5). I wanted to research if there were any interesting results in terms of Chroma features.

Dynamic Time Warping


dynamic time warping(DTW)

We saw how various music representations can be compared by transforming them into appropriate function representations. Next, we are investigating how these features can be aligned or synchronized to temporarily react. We add an important technology known as dynamic time warping to this end (DTW). Two specific sequences are to be compared by DTW. These two sequences, for instance, also depict two separate variations of the same piece of music in our music synchro-nization scenario.

Saturation

Cepstograms


Analysis of Music Structure The hierarchical system in which music is arranged is one of the characteristics separating music from random sound sources. At the lowest level, there are occurrences such as individual notes, distinguished by their tone, timbre, timing and length. When different sound events are combined, large structures like motifs, sentences and sections are obtained, and these structures shape newly larger structures which define the overall design of the composition. This higher degree of structure is often referred to as the musical structure of the work, which is defined in musical sections and in relations with one another. The intro, the chorus and the verse parts of the song can be, for instance, in popular music. The exhibition, the creation and the recapitalisation of a movement may be any classical music. The general purpose of analysis of music structures is to divide a given musical representation into temporal segments which correspond to musical sections.

“You know. Yeah. Time to wake up”

“This shit is about to go down.”

I will analyse the song “RAINY NIGHT IN TALIN”, wich in my opinion is one of his most interesting scores. I think the build up is done perfectly and the tension is huge. Also i like the timbre of the sound, because it’s hard to tell whether he used acoustic or electronic instruments.

In the beginning, you can hear guitars and strings. It’s just in two notes, but the movement and phrasing of these two notes were intriguing because it’s difficult to say whether they’re forward, reverse, or reversed.

After that you hear the middle part of the track, which is, when the actor sees some other army men running with gas tanks in the direction of the house.

And there’s a lot of elements going on here, and you can’t really tell what it is and what sound it is, but what’s happening is that it’s actually it’s guitars again.

The almost sound like alarms.

“It’s very effective. And it sounds like it’s like almost like a, you know, what do you have? Like when you when you try to wake you up in the morning like a clock, you know. Yeah. Time to wake up.But I guess it’s telling the audience, hey, guys, time to wake. This shit is about to go down.”

A major contrast can be seen between this very low-acoustic (third plot) track and the track “Dinner time” (second plot’) whith a very high accousticness. The album Tenet contains a variety of transitions and stories and has the perfect mix of acoustic instruments and electronic modular synthesizers. I believe the style quite well reflects the development of Ludwig. “Rainy night in Talin” sounds very enigmatic and very mysterious to me. It was also produced inside this pandemic so that perhaps it also had certain influences.

Self-Similarty Matrices


We have shown that the concepts of repetition, uniformity, and complexity are fundamental for segmenting a given audio recording into musically significant structural components. One method of studying structural and interconnective coherence is to convert the signal into a set of features and then group the features according to their relationships. This results in a self-similarity matrix (SSM), a method which is of profound significance not just for music structure analysis but also for the analysis of several kinds of dataset.

Keys

Where are my keys?


From top to down:

Picking up T accousticness: 0.9920
Emi accousticness: 0.9910

It’s your time accousticness: 0.5260
Killmonger vs Challa accousticness: 0.5250

Trucks in place accousticness: 0.0328
Fast cars accousticness: 0.0636

Music corresponds to the continuous sound of various notes, which create a complete structure in the listener’s head. The principal elements of harmony are music structures, consisting usually of three or more pitches, at least in the Western music context. The research of harmony may be considered as a review of chord building, connection and progression. The progression of chords over time corresponds directly to what is often called the harmonic material of a music piece. These developments have a harmonic significance in the composition, description and interpretation of western tonal music, like poplar, jazz and classical music. Therefore characteristics which capture harmonic information, including music structural analysis (Chapter 4) and music recovery, are commonly applied for musical processing tasks .

Typical chord detection systems consist of two main measures, as stated in the introduction. The first step consists of cutting the recorded sound to frames and transforming each frame into an adequate functional vector. Most recognition devices use chroma audio, which is related to the tonal detail in the audio signal. For each function vector to a set of pre-defined chord models in this second stage, pattern matching techniques are used. The best match decides the mark of the chord assigned to the frame. Additional improvements are used either before the matching stage (referred to as the pre filtration) or during / after the matching step of the pattern in order to optimize chord recognition results.

In the case of chord estimation I picked two songs with a low acousticnes (around 0), two with a high acousticness (around 1), and two with an average acousticness (around 1). (around 0.5). I wanted to see if there were any interesting results in terms of key features.

Presto

Wakanda


Energy based novelty
We saw that playing a note on a tool always coincided with a sudden rise in signal energy. For instance, when you touch a key, pluck a line on a guitar or hit a drum with a rock. On this basis, the transformation of the signal into an energy function indicating the local power of the signal in each instance and then searching for abrupt shifts in it is an easy way for detecting notice start-ups.

spectral based novelty Startup tracking for polyphonic music gets even difficult for sound events overlapping. A low impact musical activity may be masked by a high intensity occurrence. Energy variations in one instrument’s sustained period can be greater than increases in the other instrument’s attack phase. Therefore, it is usually difficult to identify all occurrences using pure power-based techniques in the case of several instruments playing at the same time. The characteristics of note start, however, can depend heavily on the respective instrument type. For example, for percussive tools with an impact, a rapid spike in energy may be seen, scattered over the whole frequency spectrum. In certain frequency ranges, even polyphonic mixes such noise-like broadband transients can be observed. Although the energy of a harmonic spring in the lower part of the spectrum is more localized, in the higher-frequency area fluctuations are also well detected.

For tempo I’ve chosen the song Kilmonger from the Black Panther’s score. I think this was an interesting song to analyse because of its different layers it contains. In the beginning you can hear a flute, which represents kilmonger’s African background. The flute is pitched down and escalates into chaos. The next layer are the strings and they go up in arpeggios. These escalate and grow bigger. Then suddenly the music cuts out and 808 drums start to kick. It feels dangerous and they come out of nowhere. They have a really low bass and sound like heartbeats. The last layer is the trap beat, representing him coming from Oakland.

Clutch

Classification

The acousticness of songs from Göransson scores are analyzed in the previous pages. This begs the question: "can a classification algorithm be built to recognize whether the acoustic variable is a strong choice?

To do this I used Random Forests. Random forests are based on a basic but essential philosophy – crowd wisdom. The explanation for the random forest model working in data science is:

“A significant number of relatively uncorrelated (trees) models working as a group would outperform all of the individual parts.”

Classification is a predictive modelling problem in machine learning that predicts a class mark for a given example of input data.

For the second plot i used k-Nearest Neighbour system. KNN is a learning algorithm which is non-parametric. Its aim is to use a database that divides data points into many groups in order to prevent the new sample point being categorized.